Proceedings of Meetings on Acoustics

نویسنده

  • Xin Dang
چکیده

Estimation of the power spectral density (PSD) of noise is crucial for retrieving speech in a noisy environment. 3 novel methods for estimating the non-white noise PSD of noisy speech based on a generalized gamma distribution and 3 criterions are proposed, which are minimum mean square error (MMSE), maximum a posteriori (MAP) and Maximum likelihood estimation (MLE). Because of the highly non-stationary nature of speech, it is difficult to derive the real probability density function (PDF) using any modeling technique. On the other hand, segmental noise is more stationary and can be fitted more accurately by a generalized gamma PDF, which is a natural extension of the Gaussian modeling for the distribution of non-white components. The results show that non-white noise spectrum fit more accurately on the generalized gamma PDF with adaptive parameters than on a Gaussian distribution function. The reported generalized gamma PDF model shows the better performance in estimating noise PSD compared to minimum statistics, MMSE-based and MLE noise PSD estimation methods. The performance of the proposed noise estimation is good when it is integrated with the speech-enhancement technique, as demonstrated log error, segmental SNR and PESQ measures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Some simulations of the effect of varying excitation parameters on the transients of reed instruments

Fabrice Silva, Vincent Debut, Philippe Guillemain, Jean Kergomard, Christophe Vergez. Some simulations of the effect of varying excitation parameters on the transients of reed instruments. 21st International Congress on Acoustics (ICA 2013), Jun 2013, Montréal, Canada. Proceedings of Meetings on Acoustics, Proceedings of Meetings on Acoustics, Vol. 19, 035058 (2013), pp.4aMU4, 2013, Internation...

متن کامل

The role of phonological alternation in speech production: evidence from Mandarin tone sandhi.

We investigate the role of phonological alternation during speech production in Mandarin using implicit priming, a paradigm in which participants respond faster to words in sets that are phonologically homogeneous than in sets that are phonologically heterogeneous. We test whether priming is obtained when words in a set share the same tones at the underlying level but have different tones at th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013